Data Schema
What is the standardized schema of OmixAtlas?
Data schema: the data available within OmixAtlas is curated within defined indexes on the basis of the information it contains. These indexes are:
- Dataset-level metadata (index: files): Contains curated fields like drug, disease, tissue organism, etc., for each dataset.
- Sample-level metadata (index: gct_metadata, h5ad_metadata, and biom_metadata): Contains curated fields like cell lines, experimental design, etc., for each sample.
- Feature level metadata (gct_row_metadata, h5ad_data, and biom_data): Contains the gene/molecule symbol along with the feature intensity for each sample.
- Variant-related data (index: variant_data): Contains the schema for variant-related information present in vcf files